Compositional Data Analysis

نویسندگان

چکیده

Compositional data are nonnegative carrying relative, rather than absolute, information—these often with a constant-sum constraint on the sample values, for example, proportions or percentages summing to 1% 100%, respectively. Ratios between components of composition important since they unaffected by particular set chosen. Logarithms ratios (logratios) fundamental transformation in ratio approach compositional analysis—all thus need be strictly positive, so that zero values present major problem. Components group together based domain knowledge can amalgamated (i.e., summed) create new components, and this alleviate problem zeros. Once transformed logratios, regular univariate multivariate statistical analysis performed, such as dimension reduction clustering, well modeling. Alternative methodologies come close ideals logratio also considered, especially those avoid zeros, which is particularly acute large bioinformatic sets.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Correlation Analysis for Compositional Data

Compositional data need a special treatment prior to correlation analysis. In this paper we argue why standard transformations for compositional data are not suitable for computing correlations, and why the use of raw or log-transformed data is neither meaningful. As a solution, a procedure based on balances is outlined, leading to sensible correlation measures. The construction of the balances...

متن کامل

Robust factor analysis for compositional data

Factor analysis as a dimension reduction technique is widely used with compositional data. Using the method for raw data or for improperly transformed data will, however, lead to biased results and consequently to misleading interpretations. Although some procedures, suitable for factor analysis with compositional data, were already developed, they require pre-knowledge of variable groups, or a...

متن کامل

Lecture Notes on Compositional Data Analysis

Preface These notes have been prepared as support to a short course on compositional data analysis. Their aim is to transmit the basic concepts and skills for simple applications, thus setting the premises for more advanced projects. One should be aware that frequent updates will be required in the near future, as the theory presented here is a field of active research. The notes are based both...

متن کامل

Clustering compositional data trajectories

This work is motivated by the following question: given a sample of compositional data trajectories (i.e. sequences of composition measurements along a domain), how can one propose a segmentation procedure leading to homogeneous classes? In other words, our contribution aims at studying statistical methods suited for clustering compositional data, when the observations are constituted by trajec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Annual review of statistics and its application

سال: 2021

ISSN: ['2326-8298', '2326-831X']

DOI: https://doi.org/10.1146/annurev-statistics-042720-124436